Speaker, Vocabulary and Context Independent Word Spotting System for Continuous Speech
نویسندگان
چکیده
Word spotting is a widely known subject in continuous speech recognition and the traditional approaches use either hidden Markov models (HMM) or Gaussian mixture models (GMM). In this paper, we propose a different approach based on language independent phoneme modeling. The proposed system is speaker and vocabulary independent, and it is easy to implement. An equal error rate (EER) of 3.34% and a figure of merit (FOM) of 45.58% are achieved on TIMIT corpus.
منابع مشابه
Spoken Term Detection for Persian News of Islamic Republic of Iran Broadcasting
Islamic Republic of Iran Broadcasting (IRIB) as one of the biggest broadcasting organizations, produces thousands of hours of media content daily. Accordingly, the IRIBchr('39')s archive is one of the richest archives in Iran containing a huge amount of multimedia data. Monitoring this massive volume of data, and brows and retrieval of this archive is one of the key issues for this broadcasting...
متن کاملA Rejection Method for the Isolated Word Recognition System
M efficient rejection method is implemented for the HMM based small vocabulary isolated word recognition system. Six clustered phoneme models are generated using statistical method from the 45 context independent Korean phoneme models which were trained using the phonetically balanced Korean speech database and the classification through likelihood ratio scoring is performed based on the cluste...
متن کاملAcoustic Modeling of Subword Units for Large Vocabulary Speaker Independent Speech Recognition
The field of large vocabulary, continuous speech recognition has advanced to the point where there are several systems capable of attaining between 90 and 95% word accuracy for speaker independent recognition of a 1000 word vocabulary, spoken fluently for a task with a perplexity (average word branching factor) of about 60. There are several factors which account for the high performance achiev...
متن کاملA Study on Out-of-vocabulary Word Modeling for a Segment-based Keyword Spotting System
The purpose of a word spotting system is to detect a certain set of keywords in continuous speech. The most common approach consists of models of the keywords augmented with \ ller," or \garbage" models, that are trained to account for non-keyword speech and background noise. Another approach is to use a large vocabulary continuous speech recognition system (LVCSR) to produce the most likely hy...
متن کاملContinuous speech dictation in French
A major research activity at LIMSI is multilingual, speaker-independent, large vocabulary speech dictation. In this paper we report on efforts in large vocabulary, speaker-independent continuous speech recognition of French using the BREF corpus. Recognition experiments were carried out with vocabularies containing up to 20k words. The recognizer makes use of continuous density HMM with Gaussia...
متن کامل